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DUPLICATE 

Dfcsatcorase Enzymes 

The invention relates to transgenic cells transformed with nucleic acid molecules 
which encode enzymes with desaturase activity_and the use of these cells end 
5 emytties inbiocatalysis. 

Desaturases axe ecozymes involved in the synthesis of long ch^m polyunsaturated fatty- 
acids (FUFAs). PUFAs are fetty acids CPAs) which are essential to the norma! 
fimctioiung of a cell and their nutot*o*ial properties are -well known. An example of 

10 a PUPA is docosahexanoic acid (DBA). DHA is a n-3 fatty acid that can be obtained 
directly from the diet or derived from metabolism of dietary linoleic and a-linolenic . 
acid. The n-3 fatty acids are associated with health promoting properties. For 
example n-3 fatty acids have been described as anti-inflammffory 9 antithrombotic, 
antiarrhyflxoaic 5 hypolipidemic and vasodilatory. As such, the role of DHA in the 

15 prevention and/or treatment of diseases such as coronary heart disease, hypertension, 
type n diabetes, ocular diseases, arthritis, cystic Sbiosis and schizophrenia has beeti " 
the focus of a great deal of medical research. 

The production of PUFAs involves a consecutive series of desaturations and 
20 elongations of the fatty acyl chain to generate arachidonic acid (20:4^5,8,1 1 ? 14) and 
docosahexaenoic acid (22:6A4 !I 7 ? 10 ? 13 3 16 J> 19)- Several desaturases involved in this 
metabolic process have been isolated from marine microalgae, including 
Phaeodaciylum tricornututn [5] ? Euglena gracilis [€] and Pavlova lutheri [7], These 
meriitbrane-boiind desatuiases are specific with respect to both chain length of the 
25 substrate and the double bond positions on the fatty acidL They belong to the class 
known as front-end fatty acid desatoases due to the fact that they introduce double 
. bonds between the carboxy-gtoup and pre-existing bond(s) of the fatty acid [1], 
These desaturases contain a cytochrome bS domain at their N-4erminus and three 
histldiTae motifs that are important for catalytic activity [10]. 

30 
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Desaturase enzymes and the gea.es which encode them ar© known in. the art. For 
example, WO03/064596 describes, amongst* other things, transgenic cells 

transformed with omega 3 and delta 12 desaturase twicleic add molecules and the use 

Of Ifeese cells in the production of fatty acids, fii particular the use of the omega 3 
5 desaturase in the conversion of arachidonic acid to eicosapentaenoie acid and the use 

of the delta 12 desaturase in the conversion of oleic acid to linoleie acid. 

WO03/099216 also describes fungal desaturases and in particular transgenic plant? 

modified to express fungal delta 1 5 desaturase enzymes, 

10 Furthermore, US2003/0157144 and US2003/0 167525 disclose delta 5 and delta 6 
desaturase gegaes in the conversion of dihomoylinoleiric acid to araehidonic acid and 
linoleic acid to y-linolenic acid respectively. Moreover, TJS2003/134400 discloses 
delta 4 desatease genes which are involved in the conversion of adrenic acid to 006- 
docosapeirtaenoic acid and in the conversion of c>3- docosapentaenoic acid to 

15 doeosahexaenoic acid. These, rare fatty acids are used in pharmacutical and cosmetic 

Besides the common FAs 16:0, 16:1A5>, 1S:0 mud 18:1A9 found in most living 
Ofgamsxns, trace amounts of more unusual fatty acids can be found in a wide range of 

20 species. For instance, presence of 16:1 All has been reported in several species of 
Pavlova, in the EustjLgmatophyte Nartnochloropsis oculata, and in the diatoms 
Pkaeodactylum tricornutum and Thalassiosira pseudonana [11 5 12 ? 13]. This FA 
accounted for a very small portion of the total FAs in these microalgae, and its 
specific role in the algal cells is unknown. However, this FA is a v«ry important 

25 precursor in the synthesis of sex pheromones in insects, Sex pheromones are species- 
specific blends of unsaturated fatty acid (UFA) derivatives that differ in terminal 
ftmctional group and in the number, position and configuration (Z or E) of the double 
bond(s), which are produced by various acyl-CoA desaturase? [14,15]. Simple 
monoene Al 1 UFAs are the most prevalent precursors in the formation of major sex 

30 pheramome components in the modern Lepidopiera [1 6471- For instance, in the com 
earworm Helicoverpa zea, which produces a pheromotte mixture of Zll-16;Ald and 
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Z9-16:Ald in a 30il ratio ? the most abundant desatasse-encodiftg transcript is 
HpeaLPAQ (also called HzPGDsT) which encodes a All-desaturase that does not 
possess a cytochrome b5 extension, and therefor© requires Jiee cytochrome b5 for 
activity- Many acyl-CoA AU-desaturases with different specificities have been 
5 isolated from insects [14,15], "but none from other species. We describe the fitst 
chai^acterisation of a cytochrome £5 desaturase exhibiting Al 1 -desaturase activity. 

Accoiding to an aspect of the invention there is provided a transgenic cell comprising 
a nucleic aoid molecule which comprises a nucleic acid sequence which nucleic acid 
10 molecule consists of the sequences as represented in Figures 5a, 5b, 6a, 7a 3 Sa, 8b 3 
9a, 10a„ 11a, lib ? or nucleic acid molecules which hybridise to these sequences, 
wheSrtsin said nucleic acid molecules encode a polypeptide which has desaturase 
activity, 

15 In a preferred embodiment of the invention said hybridisation conditions axe stringent 
hybridisation conditions. 

la a preferred embodiment of the invention said nucleic acid molecule comprises a 
nucleic acid sequence which has at least 30% homology to the nucleic acid sequence 
20 represented in Figures 5a* 5b, 6a, 7a, Sa, Sb ? 9a, 10a, 11a, lib. Preferably said 
homoIogyisatleast4Q%,50%,60%,70%, 80%, 90%, or at least 99% identity with 
the nucleic acid sequence represented in Figures 5a, 5b ? 6a> 7^ 8a„ 8b, 9a, 10a, 11a, 
1 lb Slid which encodes a polypeptide which has desaturase activity. 

25 la a preferred embodiment of the invention said nucleic acid molecule comprises the 
nucleic acid sequence as represented in Figures 5a, 5b, 6a, 7a ? 8a* 8b s 9a, 10a, 11a, 
lib. Preferably said nucleic acid molecule consists of the nucleic acid sequence as 
represented inFigures 5a 3 5b ? 6a, 7a, 8a 3 8b, 9a, 10a, 1 la, 1 lb. 

30 in a further preferred embodiment of tihs invention said cell over-expresses said 
desaturase encoded by said nucleic acid molecule, 



17. FEB, 2004 1 6:05 HARRISON GODDARD 



NO. 017 P. 9 




M a preferred embodiment of the invention said over-expression is at least 2-fold 
higher when compared to a non-transformed reference cell of the same species. 

5 Preferably $aid over-expression is: at least 3 -fold. 4-foid 3 5-fold, 6-fold, 7-feld, 8- 
fold, 9-fold, or at least 10-fold when compared to a non-transformed reference cell 
of the same species, 

lii a preferred embodiment of the invention said nucleic acid molecule is a cDNA. 

10 

In. yet a further preferred embodiment of the invention said nucleic acid molecule is a 
genomic DNA. 

Ixk a preferred embodiment of the invention said transgenic cell is traasfected with a 
15 nucleic acid molecule comprising a nucleic acid sequence as represented by Figure 
10a and -which encodes a desatarase polypeptide wherein said polypeptide has A 1 1- 
desatuxase activity, or a nucleic acid molecule which hybridises to the nucleic acid 
molecule in Figure 10a and encodes a polypeptide with Al 1-desataase activity. 

20 In an alternative preferred embodiment of the invention said transgenic cell is 
transfected with a nucleic acid molecule comprising a nucleic acid sequence as 
represented by Figure 8a and which encodes a desatuxase polypeptide wherein said 
polypeptide has A6-desaturase activity, or a nucleic acid molecule which hybridises 
to the nucleic acid molecule in Figure Sa and encodes a polypeptide with A6- 

25 desafurase activity. 

In a preferred embodiment of foe invention said transgenic cell is a eukaryotic cell. 
In an alternative preferred embodiment of the invention said coll is a prokaryotic cell. 
In a further preferred embodiment of the invention said eufcaryotic cell is a plant cell, 



4 
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Plaats Which include a plant cell according to the invention are also provided as are 
seeds produced by said plants- 

5 la a preferred embodiment of the invention said plant is selected from: com (Zea 
may$% canola (JSrassica napus t Brassica rapa ssp.) ? flax (Linum mitatissimum), 
alfalfa (Msdicagg ^atiya), tafia (Qrysa sativa), rye (Secale cerate), sorghum (Sorghum 
bicolor > Sorghum vidgare), sunflower {Helianthus armus\ wheat (Tritium aesttvum\ 
soybean {Glycine. mox% tobacco (Nicotiana tabacum), potato (Solatium tuberosum), 

10 peanuts (Arachis hypogaea\ cottpn (Gossypmm hirsuium}, sweet potato (lopmoea 
batatus) 9 cassava (Manihot esculent^ coffee (Cofea spp,), coconut (Cocos nucifera% 
pineapple (Anana comosii^), citris tree (Citrus spp.) cocoa (Tfteobroma cacao), tea 
{CemeUia sinensis)* banana (Musa spp.) ? avacado (Persea americana), fig (Ficus 
casica), guava (Pgidium guajava% mango (Mangifer indica% olive (Olea europaed), 

15 papaya (Canca papaya), cashew (Anacardium occidental), maoadamia (Macadamis. 
iritergrifelia% Mmond (Pnmus amygdalus), sugar beets (.Seta vulgaris), bats, barley, 
vegetables and ornamentals. 

Preferably, plants of the present invention are crop plants (for example, cereals and 
20 pulses, maize, wheats potatoes, tapioca, rice, sorghum, millet, cassava, barley, pea), 
and other root, tuber or seed crops. Important seed crops are oil-seed rape, sugar 
beet, maize, sunflower, soybeao.sorghum, and flax (linseed). Horticultural plants to 
which the present invention may be applied may include lettuce, endive, and 
vegetable brassicas including cabbage, broccoli, and cauliflower.* The present 
25 invention may be applied in tobacco, cucurbits, carrot, strawberry, sunflower, tomato, 
pepper. 

Grain plants that provide seeds of interest include oil-seed plaats and leguminous 
plants. Seeds of interest include grain seeds, such as com, wheat, barley, rice, 
30 sor,ghum, rye, etc. Oil seed plants include cotton, soybean, safflower, sunflower, 
Brassica, mai ze, alfalfa, palm, coconut , etc. Leguminous plants inc lude beans and 
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peas, Beans imchiite guar, locUtt ^ .feragisek, wybm P«fe ^ 
mungbean, lima bean, favabeen, lentils, chickpea, etc. 

According to a farther aspect of the invention there is provided a seed comprising a 
5 . plant cell according to the invention. Preferably said seed is from an oil seed plant- 

According to a yet farther aspect of the invention tbete is provided a reaction vessel 
comprising at least one polypeptide according to the invention, fatty acid substrates 
and co-factors wherein said vessel is adapted for the desaturation of said fatty acids 
10 substrates. 

In a preferred embodiment of the invention said polypeptide is expressed by a cell 
according to the invention. 

15 Preferably said cell is a eukatyotic cell, for sample a yeast cell. 

In an alternative preferred embodiment of the invention said cell is a prokaryotic cell. 

According to a further aspect of the invention there is provided a method to 
20 desatutate a fatty acid substrate comprising the steps of: 

i) providing a reaction vessel according to the invention; and 

ii) growing said cells contained in said reaction vessel under conditions 
which allow the desaturation of at least one fatty acid substrata. 



25 An embodiment of the invention will now be described by example only and with 
reference to the following tables and figures: 

Table 1 illustrates the composition o f major fatty acids in T. pseudonana; 



30 



Table % illustrates the major fatty acids of pYES and pYDESN yeast transformante 
with and without addition of exogenous saturated fetty acids; 
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Table 3 illustrates the A6 desaturase activity of TpDESl compared to that of an 
homologous Phaeodactylum tricarnutwn desaturase; 

5 Figure 1 illustrates the predicted protein sequences with homology to front-end 
desaturases derived from the T. pseudonana draft genome. Sequence alignments of 
12 putative T, pseudonana desaturase3 with other functionally characteriaed front-end 
desaturase enzymes identified three main blocks of homology that represent the 
functional domains of front-end acyl deBaturases (A). The darker shaded box 
10 highlights the cytochrome bS haem-bmding domain and shaded boxes indicate ihxee 
bistidine boxes. See Material and Methods for Genbank accession number and source 
species of the functionally characterised enzymes. A phylogenetic tree of nine T. 
pseudonana desaturases with other enzymes was constructed (B). By removing the 
regions containing gaps (ambiguous alignment region), a dataset was created from an 
15 alignment originally made with clustalX. The free was constructed from the dataset 
using Phylip3,5c software package and bootstrap analyses were carried out with 1 000 
replicates. Only well supported nodes (over 70%) are indicated with bootstrap values. 
All branches are drawn to scale as indicated by the scale bar (=0. 1 substitutions/site). 
TpDESN sequence is 477 amino acids long (C). The cytochrome b5 haem-bhulmg 
20 domain is on a shaded background and the three histidine-boxes are framed; 

Figure 2 illustrates RT-PCR expression analysis of TpdesN. Cells were harvested at 
different stages of growth for total ENA extraction and cDNA synthesis (A). PCR 
was performed on cDN A derived from reverse transcribed SNA using TpdesN and 
25 1 8s rSNA specific primer pairs <B). PCR was carried out on undiluted (lane 1) and 
five-fold serial dilutions (lane 2-4) of each cDNA. The IBS rRNA gene was used as a 
control of cDNA synthesis. EE: early exponential phase, LE: late exponential phase, 
ES: early stationary phase; 

30 Figure 3 illustrates GC analysis of FAMEb from yeast transformed with the empty 
plasmid pYES2 or the plasmid containing TpPESN. ftivscl yeast strain transformed 
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c - ■ 

with either pYES2 (A) or pYDESN (B) were induced for three days at 20°C without 
supplemeiatatioii before sapling for fatty acid a^lysis. I. S/ internal standard {17:0). 
The experiment was repeated three times and results of a representative experiment 
are shown; 

5 

Figure 4 illustrates mass spectra of DMDS FAME adducts from pYDESN 
transformed yeast. Mass spwtrum of the DMDS adduct of 16:IA9 FAME, present in 
all yeast samples (A). Mass spectrum of the DMDS adduct of 16:lAl 1 FAME, which 
was only fouad in yeast transformed with pYDESN O). Ficolinyl esters with spectra 
10 characteristic of 16\ lAl 1 were also identified hi these samples (data not shown); and 

Figure Sa is the genomic nucleic acid sequence of the desatutase A from F. 
p&eudtmtmai Figure 5b is the cDNA sequence desaturase A; Figure 5c amino acid 
sequence; 

15 „ ; . . *;. . ; ,. . 

Fxgurefia is ffiegenotbSc" nucleic acid sequence of desatura.se B from Thalassiasird 
pseudonanai Figure 6b is the partial amino acid sequence; 

Figure 7a is the nucleic acid sequence of desaturase E from Thalassiosira 
20 psuedonana; Figure 7b is the amino acid sequence; 

Figure 8a is the nucleic acid sequence of desatu*ase I from Thalassiosira 
pzeztdonana; Figure Sb is the cDNA sequences and Figure Sc is the amino acid 
sequence; 

25 

Figure 9a is the nucleic acid sequence of desaturase K from Thalassiosira 
pseudonana; Figure 9b is the amino acid sequence; 

. Figure:. 10a is the nucleic acid sequence of desaturase N from Uialassi&sira . 
30 pseudonana. 
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Figure lla is the nucleic acid sequence of desaturase O from Thalassiosira 
p$eudvnana\ Figure lib is the cDHA sequence; and Figure 11c is the amino acid 
sequence; 



5 Figure 12A and 12B is a GC analysis of FAMEs from yeast expressing TpDESI with 
exogenous substrates 18:249,12 (A) and 1 8;3A9,12 5 15 (B). New FAs produced from 
endogenous and exogenous substrates are underlined. 

Figure 13 is an illustration of fatty acid synthesis pathways. 

10 

Materials and Methods 

Identification of putative Thfllassfnsira nseudtijuaiia desat urase-coding 
sentiences and phvlogenetic analysis w ith other functionally characterised 
15 desatarases 

The draft genome of the diatom Z! pseudanana has been, sequenced to approximately 
nine times coverage by tihe whole genome shotgun method. Sequence data were 
produced by the US Department of Energy Joint Genome Institute 

20 flittD://www.igi.doo.govA and the raw sequence data were downloaded and installed 
on a local server. Batch tblastn searches were earned out using protein sequences of 
the following 13 known desaturases as query, including PEDES 1 (AY332747, 
Pavlova lutheri), TFAD4 (AF4895S9, Thraustochyirium sp. ATCC 21685), TFAD5 
(AF4S95S8, ThraustochytHum sp. ATCC 2l&%5% PtDELS (AY082392, 

25 Phaeodactylum trievrnutum)* FtDELtf (AY0S2393, Phaecdactylum tricomutuni), 
EgDELS (AF139720, Euglena gracilis), EgDEL4 (AY27855S, Euglena gracilis), 
ZfDEL (AF309556, Danio rerio\ B0DEL6 (U79010, Baraga officinalis)* HsDELS 
(AF0S455S, Homo sapiens\ HsDEL6 (AF0S4559, Homo sapiens), G&DEL6 
(AF031477, Caenorkabditis etegans) and. CeDELS (AF078796, Caenorkabditis 

30 etegans). 
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All nonnredundant sequences with, an E value less than 0*001 were retrieved and 
assembled into contigs wing the CA!P3 sequence assembly program [18]. The conttg& 
were faan$lated into amino sequences in three frames in the orientation indicated by 
tblastn result Putative desatuxase gene models were constructed manually based on 
5 sequence homology and in frame GT-AG intern boundaries were identified. 



Deduced amino acid sequences of all 12 putative desaturase sequences of 21 
pseudonana were aligned with the above 13 functionally characterised desaturases 
from other species, using ClustalX version 1.8 [19]* The alignment was then 
10 reconciled and further adjusted. Only nine near full-length Tha!as?ioszra sequences 
were retained for further analyses. 

A dataset of 250 conserved residue positions, was used for construction, of the 
phylogenetic tree. Distance analysis -used the progtam pratdist of the Phylip 3.5c 
15 package with a PAM250 substitution matrix and a tree was then built from the matrix 

~ ~ using fitch (Fitch-Margoliash method). Bootstrap analyses were earned out with 

1 000 replicates using the neighbour-joining algorithm. 

Cultivation of 31 nseudonana 

20 

H pzeudonana (CCAP 1 OS 5/1 2) "was obtained from the Culture Collection of Algae 
and Protozoa (Dunstaffiiage Marine Lab. a Oban, PA34 4 AD, Scotland, ILK,). The 
growth medium used was enriched artificial seawater medium (EASW), made up in 
20 1 batches as described previously [4]. The cultures were grown in one litre flasks 
25 at 15°C with 50 j.iE m" 2 s' 1 constant illumination, and aeration provided by shaking, 
the flasks at 150 rpm. 



30 



Cell density was monitored by counting cells with a haemocytometer. Nitrate 
concentration was determined periodically during the culture time by measuring the 
change of the medium absorbance at 220 ran [20], 
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RNA extraction* eDNA st>m€hesis and RT-PCR analysis 

5 Total RNA was extracted from ftozen cells harvested at different stages of growth 
with an KNeasy plant mini kit (Qiagea). First strand cDNA was syothesised from 
three fig of DNAse treated RNA using a Prcstsr First-strand RT-PCR kit 
(Stratagene). PGR was perfbiraed using undiluted and five-fold dilutions of cDNAs 
as followed: the reaction were heated to 95 °C for 5 min followed by 35 cycles at 95 

10 °C for 30 s ? 50 °C or 65 6 C (for ISSrRNA and TpdesNTZsptttivtiLy) for 30 s and 72 °C 
for 2 nun, then a single 72 *C for 10 min. As a marker for constitutive expression, the 
18S rRNA gene was amplified with the primer TH18S5 1 (5'- 
GGTAACGAATTGTTAG-3') and TH18S3 1 (5'-GTCGGCATAGTTTATG-3')« 
TpdesN eDNA was amplified using primers DESNR2 (5'- 

15 OTGAGAGCACTAACCAAGCTT-3 ? ) and DESN2 (5*- 

CAATCAGTAGGCTTCGTC G-3 5 ). Aliquots of PGR reaction were dectrophoresed 
through a 1% agarose gel. Identity of the diagnostic fragment amplified with TpdesN 
specific primers was verified by sequencing after cloning in the pGBM-T EasyVector 
(Promega). 

20 

Functional characterisation of TpDESI in yeast 

The satire TpdesI codfcjg region was amplified from T. pseudanana cDNA 
with primers DwJNB 5 y - GCG GGATCCA CCATGgCrGGAAAAGGAGGAGAC- 

25 ' 3' (ORF start codon is indicated by bold type; undefined sequence is a BamHl site; 
italic sequence is an added alanine codon ? not present in the original sequence of 
Pldesl) and DesICE 5>~GC GAATTC TTAC ATGGf!AGGGAAATC-3 8 (ORF stop 
codon is indicated in bold type; underlined sequence is a EcoBl site). The Expand 
High Fidelity PGR. system (TRoche) was employed to minimise potential PGR errors. 

30 The amplified product was gel purified, restricted and cloned into the corresponding 
sites behind the galactose-inducible GAL1 promoter of pYES2 (Invitrogon) to yield 
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the plasmid pYDESl. This vector was transformed into S. c&revisiae sixain lavscl 
(InvitrogeG.) by a lithium acetate method, ami transformaafe were selected cm 
minimal medium plates lacking uracil. 

For functional expression^ cultures were gro*wn at 25°C in the presence of 2% 
5 (w/v) raflBnose and 1% (w/v) Teargitol NP-40 (Sigma), Expression of the transgene 
was induced when ODeocmm reached 0.2-0.3 by supplementing galactose to 2% (Wv). 
At that time!! the appropriate fatty acids ware added to a final concentration of 50 fiM- 
Itiaubatipn was carried out at 25 °C for three days. 

10 Fangtionat characterisation of TpPESN in veast 

Genomic DNA fittra T. pseudonana cells was extracted using the DNA isolation kit 
. Puregene (Gentra Systems) and 1 00 ng was used to amplify the entire TpdesN coding 
region with primers > DesNNB 5 V 

15 GC GGGATCCA CCATGgC7GACTTTCTCTCCGGC-3 ' (OEP start codon is 
indicated by Jfcfld type- underlined sequence is a BamHl site; italic sequence is an~ 
added alamo.? codon, not present in the original sequence of TpdesN) and DesNCE 
5'-GCGAATTCTCAATCAGTAGGCTTCGT-3* (ORP stop codon is indicated in 
bold type; underlined sequence is a BcoRL site). The Expand High Fidelity PGR 

20 system (Roche) was employed to minimise potential PGR errors. The amplified 
product was gsl purified, restricted with EcoRl and Sa?nRl and cloned into the 
corresponding sites behind the galactose-inducible GAL1 promoter of pYES2 
(Thvifrogen) to yield the plasmid pYDESN. The fidelity of the cloned PCR product 
was checked by sequOTcing. The vector 5. 

25 cerevisiae strain Ihvscl (Iiivitrogen) by a lithtuta acetate method, and transfonnants 
were selected on minimal medium pistes lacking uracil. 

For the feeding experiment with PUFAs, cultures were grown at 22*C in the presence 
of 2% (w/v) raf&fliose and 1% (w/v) Tergitol NP-40 (Sigma), Expression of tJie 
30 transgeoie was induced when ODemnm reached 0.2-0.3 by supplementing galactose; to 
2% (w/v). At that time ? the appropriate fatty acids were added to a final concentration 
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of 50 pjVL Incubation -was carried out at 22°C for three days and then 15°C for 
another three days. For the feeding experiment with saturated fatty acids, a single 
Itwscl colony transformed with pYESZ (empty plasmid, control) or pYDESN was 
inoculated in 10 ml of minimal media njimis uracil containing 2% raffinose and 
5 gsown overnight at 30*C with shaking (300 rpm). After 16-24 h, cells were collected 
by spinning at 4500 rpm for 10 min. After discarding the supernatant, the cell pellet 
was resusp ended in the same medium mentioned above supplemented with 2% 
galactose and t% tergitol ? to obtain a cell density of 5x1 0 7 cells/mL Fifteen ml of (his 
cell suspension were added to a 100 inl-flask -with or without addition of saturated 
10 fatty acids (as mentioned in the text) at 500 jiM final concentration. Desaturase 
induction was then carried out at 20°C with shaking (300 rpm) for three days. 

Fattv acid analysis 

15 Microalgae or yeast cells were harvested by centrifugatioa Total fatty acids were 
extracted and transmethylated as previously described [4]. Most FAMEs were 
identified by comparison of retention times to a 37 FAME mix (Supelco). FUFA 
FAMEs were also identified by comparison to a sample of standard Menhaden oil 
(Supelco) tjrajismethylated as per the samples. 

20 

Ditnethyl disuiphide ODMDS) adducte were used to determine the double "bond 
position in identified and unidentified moiiouiisaturated FAMEs, These were made 
by adding together 50 ]xl DMDS (Aldrich), 100-1000 rag FAMEs dissolved in 50 fil 
hexane, and 5 pi 50 mg ml" 1 iodine in diethyl ether, This solution was heated at 40°-C 

25 for 15 h. and partitioned with 200 fil hexane and 100 pel 5% (w/v) sodium 
thiosulphate. The hexane phase was removed, dried under vacuum, reconstituted m 
50 pi fresh hexane and used for GC-MS analysis, A Trace GC 2000 (ThermoQuest) 
fitted with a30mx 0.25 mm x Q.5 \xm film thickness ZB-1 column (Phenomenex) 
was used to ehromatagraph 2 jxl DMPS adducts injected at 250*C and a 50:1 split 

30 . ratio with He as carrier gas at 0.6 ml min" 1 in constant flow mode, The oven program 
was 120°C for Imin then to 340°C at 5°C mixcK Mass spectra were obtained using a 
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GCQ (ThetmoQuest) mass spectrometer operating in full scan mode over 50-500 
m/z. Picolinyl esters were also made from FAMEs to confirm their identities. These 
were obtained by adding 15 ixl fireshly prepared 2;1 (v/v) 3-(hydro^ymethyI)-pyri 
(Aidrich): potassium tert butoxide I M solution in tetrahydrofiit^ (Aldrich) to 50 fil 
5 FAMEs dissolved in diehloromethane* This solution was heated at 40°C for 30 mm 
and partitioned with 200 pi bexa^e and 100 \xl 2.5% (w/v) sodium hydrogen 
carbonate. The hexsne phase was removed, dried under vacuum ap.d reconstituted in 
50 \il fresh hexane. Picolinyl esters were injected and separated by GC-MS using the 
same conditions as for DMDS adducts; Sperling P., Zahringer U, and Heinz E. 
10 (199S) A sphingolipid desaturase from higher plants, J. EioL Chem. 273, 2&590- 

2S596; Sperling F r , Libiseh B., Zahringer U., Napier J.A. and Heinz E. (2001) 
Functional identification of a DS-sphfogolipid desataass torn Borago officinalis. 
AxoK Biochem. Biophys. 38S 5 293-29S; Whitney H.M., Michaelson, LJV„ 

Sayaaoya, 0. ? Pickett J. A. and Napier, LA. (2003) Functional characterization of two 
15 two cytochrome b5-fusion desaturases &om Anemone leveilfei; Hie unexpected 

Identification of a fatty acid AS^desaturase- Plsjnta 217, 983-992; each of which are 
incorporated by reference. 

EXAMPLE 1 . 

20 Identification and phylogenetic analysis of putatfve T. pseudonana desaturase 
sequences with other functionally characterised desaturases 

Tblastn searches with 13 ftmctionalLy characterised desaturases revealed 427 non- 
redundaat raw sequences with E values less than 0.001. Twelve unique contigs were 
— * assembled after -re£^ constructed : ~ - 

25 manually based on sequence homology, These 12 gene contigs were arbitrarily 
designated Tpd&sA to TpdesL. All 12 showed significant sequence similarity to query 
sequences with 9 containing near full length open reading frames compared to other 
known desaturases (Fig. 1 A). Inl^estingly, the predicted amino acid sequence of all 
nine J. pseudonana desaturases have a characteristic fused cytochrome b5 haem- 
30 Hndittg domain (HP[G/A]G) at their Kf-tqnninus and three histidme boxes (H[XI3- 
4H, H[XJ2~3HH AND Q[X]2-3HH) with the replacement of the first histidme by 
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^utanrine in the Hwrd histidine box in all but two of the pr&dioted proteins (TpDESA 
and T^DESB). These are common characteristics of a large subgroup of front-end 
acyl group desatarases [21], These Mstidine-box motifs are critical for desaturase 
activity, most likely because they serve to coordinate Hie diiron-oxo component of 
5 the active site. Three remaining sequences (TpDESD, TpDESL $nd TpDESH) appear 
to be partial, covering only the C-tettninal end of desaturases, but nevertheless they 
do contain a typical third Mstidine box of the above mentioned subgroup of 
desaturases (Fig- 1 A). 

10 In order to gain insight into the relationships of these T, pseudonana sequences to 
other fimctionally characterised desatiocases arid especially algal desatarases, we 
constructed an unrooted phylogenetib tree using a Fitch-Margoliash method with 
statistical confidence measured by bootstrap analysis (Fig. IB). Relationships of four 
putative T. pseudonana desaturases are hi well supported (>70% bootstrap value) 

15 subgroups with at least one functionally cbaracferised desaturase from other species. 
Both TpDESM and TpDESO gtottped with FtDELS, a A5~desaturase from another 
diatom, P. tricornutum [5] 5 suggesting these two enzymes may also have a A5- 
desattjrase activity. Similarly TpDESK is grouped with two A4-de$aturases TFAD4 
and EgDEL4 from Thrausiochyttium sp, ATCC216S5 [22] and E. gracilis 

20 respectively, TpDESI grouped with PtDELS, a A6-desatease from P. tricornutum. 
This indicates that TpDESK and TpDESI may have A4 and A6-desaturase activities 
respectively. However, as easymes with different regioselectivitiss are also found hi 
a well supported subgroup (EgDELS, CeDEL5 and CeDEL6; AS, A5 and A6- 
desatoas© respectively) and regioselectivity may sveia derive rt^dqpenuently after a 

25 more recent duplication (CeDEL5 and CeDEL6) [23] predictions based ox* homology 
can be misleading and it is essential to fimctionally characterise each enzyme. 

The regaining five T! p$&udonana sequences fall into three separate subgroups 
(TpDESE; TpDESA and TpDESB; TpDESG andTpDESN) which do not group with 
30 any other known functional desatarases with high confidence. It is therefore possible 
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that these proteins exhibit novel regioselectivity. The content study focussed on the 
characterisation of one of these proteins, TpDESN. - 

EXAMPLE 2 

5 

Temporal expression of TpDESN gene 

RTVPCR analysis of Tpd&sN transcript was conducted at different stages of algal 
growth, in order to establish if and when this gene is expressed. After RNA extraction 

10 and cDNA synthesis, TpdesN specific PGR products -were amplified. PGR 
amplification of the 1 SS xDNA gene was perfbmied as a control for the quantity of 
cDNA used dining PGR reactions, Figure 2 shows that fee diagnostic 519 bp cDNA 
amplification product expected for TpdssN was present: at similar level at the 
different stages of cultivation of the microalga cells, Thus, Tpd&sN is 

15 tamscriptioiiaUy active at a constitutive level during Thalassiosira growth, 
suggesting that it may encode a desaturase with a housekeeping function. 

EXAMPLE 3 

20 Functional characterisation of TpDESN in yeast 

The putative desaturase sequence annotated TpdesN was contained on a genomic 
DNA contig of 2580 bp on which no introns was detected. To establish the function 
of the protein encoded by this gene a the full-length sequence was amplified from 

25 genomic DNA. An alanine codon containing a G as the first letter was added 
immediately downstream of the start codon of Tpd®$N to ensure optimal translation 
iik yeast [24]. The Tpde&N ORF is 1434 bp Jons and encodes a 477 amino soid 
protein TpDESN (Fig. 1C), having a molecular weight of 53.8 kDa. Analysis of the 
secondary structure of TpDESN using SOSUI software 

30 <htlpi//sasm.protecm [25] predicted four 

ifxansmembrane region (not shown). Alignment of TpDESN with functionally 
characterised desaturase sequences mentioned above indicated m overall identity of 
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25% ? with fhe cytochrome &J-like domain and the three conserved histiditie-rich 
motif areas showing greatest homology, 

The primary sequence of TpDESN exhibited typical features of front-end desatuxases 
5 involved in PUFA synthesis. In order to characterise the specificity of this protein, 
PUFAs (18:2A9 S 12; 20i2Al!,14; 20:3A8»1U4; 22t4A5 p 8,H,14; 1S:3A9 ? 12,15; 
20:3AH,14 5 17; 20:4A8 ? 11,14,17; 22:547.10,13,16,19) where first fed to the host 
yeast transformed with pYDESN and the vector alone (pYES2) as a control 
Unexpectedly, after six days of incubation, TpDESN did not desaturate any of fhe 

10 supplemented PUFA substrate, Fur£hermore ? there did not appear to be any 
production of 1S:2A9,12 from endogenous 18:1A9- However, a significant increase 
was observed for a peak elating in the range of sixteen carbon monounsatoated 
FAMEs in the yeast transformed with pYDESN (Fig. 3), The position of the double 
bond in this product was determined by GC-MS analysis of FAMEs derived to 

15 DMDS adducts [26] and picolinyl esters, The DMDS adduct of 16.1A9 FAME 
yielded two major fragments at m/z 145 and 2,17 (Fig. 4A). Fragmentation of the 
increased FAME peak found in unfed or fed yeast transfoxmed with pYDESN 
produced two diagnostic fragments at mfz 117 and 245 (Fig, 4B). This fragmentation 
pattern was indicative of an All monot^satuxated sixteen carbon FAME, 16:1A1 1 3 

20 suggesting that TpDESN encoded a new Al 1-desaturase. Small amounts of this FA 
hive also been measured in Thalassiosira cells (Table 1)- To further substantiate 
these results, yeast transformed with pYDESN and the control empty vector* pYES2, 
were cultivated in medium supplemented with saturated FA (14:0; 16:0; 18:0) 
representing potential substrates fot the synthesis of the monouasaturated product. 

25 Yeast fatty acid profiles were analysed after three days of incubation at 20°C. Results 
in Table 2 showed that a small amount of 1 6: 1 Al 1 (0.23% of total FAs) was detected 
in yeast transformed with. pYES2 ? suggesting endogenous synthesis of this FA from 
16:0, This FA accumulated at a higher level in both types of transformed yeast after 
feeding with 14:0, with values up to 5.84% in pYDESN transformants. A possible 

30 explanation for this increase m the pYES2 transfoimants is that the endogenous yeast 
A9-desaturase was able to use additional 14:0 to produce 14:1A9 that was 
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subsequently elongated to 16:1 AIL Moreover, it has been reported that wild type 
yeast cells cultivated in media supplemented with 14:1 AS> synftesised 16:1 AH by 
Elolp-dependent carboxy tenranal elongation [27]. After 18:0 supplementation, the 
percentage of 16:1 All, of about 6% total FAs, was similar to that observed after 
5 feeding with 15:0. Presence of extra 18:0 could lead to an inhibition of the 16:0 
Chain elongation system, which might allow more 16:0 to be available for All- 
desaturation. On the other hand, 18:1 All represents 1.2% of the total FAs in 
transgenic yeast. No variation in its proportion was monitored under the different 
conditions of incubation, even after supplementation -with 18:0 in pYDESN 
10 transfomante. This suggests that this FA originates from elongation of 1 5:1 A9 rather 
than Al 1-desaturation of 1 S:Q. 

EXAMPLE 4 

1 5 Functional characterisation of TpDESI in yeast 

.To -e^blishjQie fhtjction of TpDESI, the MMength cDNA was expressed in the 
yeast Invscl under the control of an inducible galactose protaoter. Potential 
substrates of firant-end desaturases (1S:2A9,12; 1S:3A9 ? 12 3 15; 20:3AS 3 ll,14; 

20 20:4AS ? 11 ? !4 ? !7; 22:4A7,I0,13,16; 22:5A7 a l 0,13,1$,! 9) were tested. Figure 12A and 
12B show that after supplementation of the medium with 18:2A 9,12 and 18:3 A* 12 * 15 
respectively;, and after three days of incubation, yeast cells containing pYDESI had 
extra fatty acids. Extra peaks observed when cells were fed with 18:2A9 3 12 had a 
retention time identical to 16:2AS P 9, 18:2A6 3 12 and 18:3A6,9,12 (Figure 12A). Extra 

25 \ peaks observed whisnjcells were fed with 18i3A9 5 12 3 IS had a retention time identical . 
to 16;2A6 ? 9 ? 1S:2A&,12 and 18^6,9,12,15 (Figure 12B), These results demonstrate 
that TpdssI encodes a Afi-desaturase which can introduce double bond in 
exoganously fed 1S:2A9,12 and 1S:3A9,12 3 15 fatty acids, but also in endogenous 
16:1A9 and 18:1A9 fetty acids. Percentages of conversion of these different 

30 substrates are given in Table 3, 
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Fatty acid profiling of marine micraalgae had shown that 3! pseudonana represents a 
good candidate to discover genes involved in. the production and storage of PUFAs 
[4]. Analysis of the recently completed draft genome of this microalga revealed the 
presence of many candidate genes for elongase and desaturase activities most 
5 probably involved in. catalysing different steps of the PUFA hiosynthetic process. We 
have identified 12 possible desaturase genes, 9 of which there is sufficient sequence 
information to demonstrate that they exhibit typical features of front-end desaturases 3 
Le. a cytochrome h5 domain in the N-terminus and three histidxne clusters located at 
highly conserved regions. Phylogeuetic analysis revealed that several of the genes are 
10 closely related to a number of previously characterised fnraJb-ead desaturases 
involved in PWA synthesis. However, the current work highlights the feet that 
desaturase function, in terms of regl ^selectivity, cannot solely be based on prediction 
from primary amino acid sequence homology, 

15 The fatty acid profile of T* pseudonana cells is quite diverse (Table l\ with the 
health beneficial: EPA (20:5*5,8,1144,17)- and DHA (22:6A4,7,10 P 13 ,16,19) 
accounting for a large proportion. However, the number of desaturase gene sequences 
found in the genome was higher than we expected based on the number of different 
desaturation reactions required to produce fee diversity of FA in this microalga. This 

20 suggested that non-obvious de-saturation reactions might also occur in the 
Tkalas&iossira cells. As a first step to establishing Amotion of the many putative 
desaturase sequences, we focused on the TpdesN contig due to the fact that the 
sequence was &ll-lmgth and intronle&s. A temporal expression study showed that 
TpdesN was constitutively transcribed during algal cultivation. Egression of the 

25 TpdesN GRF in yeast supplemented with. PUFAs as potential substrates for 
desaturation revealed no new products. These was also no evidence of activity with 
the endogenous 18:1A9 which excludes the possibility that TpDESN acts as a A12- 
desaturase. However, an increase in, the peak area of a FAME eluting in the range of 
the sixteen carbon FAMBs was identified and GC-MS based analysis revealed this to 

30 be 16:1 A3 1 fatty acid. Small amounts of this FA axe also pteseirt in wild type yeast 
However, quantitative comparison of FA levels in the empty vector pYES2 and 
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pYDESN tamsfoiroants showed that proportions of 16:1 All increased in the 
presence of Tp4&Nm both unfed cells and cells that had been fed different saturated 
FA. No other changes in either peak area or new peaks were detected in pYDESN 
traasformants, indicating that TpDESN i$ specifically involved in conversion of 16:0 
5 tol6:lMl. 

The presence of small amounts of 16:1 All have previously been reported in many 
microalgae, including T, pseudonnna. However, a function for this FA in algal cells 
has not been established, The low quantity* observed in many marine micro algae 

10 suggests that it may act as an intermediate in an as yet unidentified biosynthetic 
pathway, in insect cells, 3 6; 1 All represents an important precursor for pheromone 
synthesis, where it is produced by an acyl-CoA All-desatuxase. Interestingly, the 
insect Al l-desaturases do not possess a cytochrome b5 domain m their N-temiinal 
region. This represents a major primary structure difference compared with TfcDESN. 

15 The cytochrome b5 domain is not a determinant of the substrate specificity [23]. 

• : . 1 „. . Ahgnmexit of the desaturase domain of TpDESN* with the. full sequence of insect 
All-desaturases showed an identity of 20% (data not shown). In insect cells, All- 
desaturases are more or less specific depending on the origin of the sequence and 
wellHfccinnented reviews exist on this subject [14,15]. 

20 

In conclusion therefore, although the TpDESN primary sequence is very similar to 
front-end desaturases. it should not be considered a member of this family of 
desaturases because it acts only on 16:0. Identification of such a novel enzyme 
expands the fimotional repertoire, of the^tn^mbrane-bpund desaturases and it should. . _ . . _ 

25 provide useful comparative information for undemanding phyloge&ietic relationships 
between these enzymes. One question that remains to be answered regards whether 
cytochrome bS was independently fused to desaturases that had already acquired their 
different specificities, or whether an ancestral fusion protein for proximal lipid 
modification duplicated and subsequently evolved into different desaturases. Studies 

30 of the primary structure of the different PUFA desaturases support the fact that 
enzyme conversion (i.e. change of specificity) can be achieved through a relatively 
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few structural changes [29]. The high degree of homology between the many 
potential front-end desatura$es identified In the genome of T- ps&uionana support 
this notion. Given the FA profile of Z pseudonana cells and the complexity of the 
desaturase gene family it is likely that different genes will encode A4 3 A5 and A6 
5 desaturases. It will now be very interesting to fictionally characterise these 
remaining putative desataase genes and study the relationship between 
regioselectivityj primary amino acid sequence and phylogenefcic relationship. A 
crystal structure for these enzymes is still not available due to technical difficulties in 
obtaining sufficient quantities of purified membrane-bound protein. Molecular 

10 genetic approaches involving site-directed mutagenesis have provided new insight 
into structure-function retetioflships, including for example that residues in close 
ptotfircrity to the histidine motifs have been found to be involved in shifting the ratio 
of desatumtioivTbiyclroxylation activities [30], Detailed comparative analyses and 
computer modeling of these diverse desaturases from T. pseud<?n&na may further 

15 guide site-directed mutagenesis studies aimed at defining 'key residues controlling 
substrate specificity and regioselectivity of the introduced double bond. 
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Claims 



1- A transgenic cell comprising a nucleic acid molecule which comprises a 
nucleic acid sequence which nucleic acid molecule consists of the sequences 
5 represented in Figures 5a, 5b, 6a, 7a, 8a, Sb, 9a a 10a, 11a, lib, or m*cleic acid 
moleoules which hybridise to these sequences, wherein said nucleic acid molecules 
encode a polypeptide which has desaturase activity. 

IS 

2. A cell according to Claim 1 wherein said hybridisation conditions are 
10 stringent hybridisation conditions. 

3. A cell according to Claim 1 or 2 whereto, said nucleic acid molecule 
comprises the nucleic acid sequence as represented in Figures 5a, 5b ? 6a ? 7a 7 8a, 8b, 
9a, 10a, 11a, lib. 

15 

. 1. ; _ "4/ _ m A cell according to CMm 3 wherein said nucleic acidmolecule consists of 

the nucleic acid sequence as represented in Figures 5a, 5b, 6a, 7a, Sa, 3b, 9a, 10a, 
11a, lib. 

20 5. A cell according to any of Claims 1-4 wherein said cell over-expresses said 
desaturase encoded by said nucleic acid molecule, 

6. A cell according to any of Claims 1-5 wherein, said transgenic cell is 
m fransfected with a nucleic acid molecule comprising a nucleic acid sequence as 
25 represented by Figure 10a and which encodes a desaturase polypeptide wherein said 
polypeptide has All -desaturase activity, or a nucfeic acid molecule which hybridises 
to the nucleic acid molecule in Figure 10a and encodes a polypeptide with All- 
desaturase activity. 
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7. A cell according to any of Claims 1 ^5 wherein said transgenic cell is 
frajisfscted with a nucleic acid molecule comprising a nucleic acid sequence as 
represented by Figure 8a and which encodes a desaturase polypeptide wherein said 
polypeptide has A6-desaturase activity, or a nucleic acid molecule which hybridises 
to the nucleic acid molecule in Figure 8a and encodes a polypeptide with A6- 
desaturase activity. 

8. A cell according to any of CHms 1-7 wherein said transgenic cell is a 
eukatyotic cell. 



9. A cell according to any of Claims 1-7 wherein said cell is a prokaryotie cell. 

10. A cell according to Claim S wherein said et^aryotic cell is a plant cell. 
15 11. A plant comprising a cell according to Claim 8. 

12. A seed comprising a cell according to Claim 9. 

13. A plant or seed accordiqg to Claim 1 1 or 12 wherein said plant or seed is an 
20 oil seed plant. 

14. A reaction vessel comprising at least one polypeptide encoded by a nucleic 
acid molecule wherein said nucleic acid molecule comprises a nucleic acid sequence 
consisting of the sequences as represented in Figures 5a 3 Sb ? 6a ? 7a, 8a, Sh s 9a, 10a s 

25 lla 5 lib, or nucledc acid molecules which hybridise to these sequences, and which 
encode a polypeptide which has desaturase activity; fatty acid substrates and co-. 
factors wherein said vessel is adapted for the desaturation of said fatly acids 
substrates. 

30 15. A vessel according to Claim 14 wherein said polypeptide is expressed hy a 
cell according to the invention. ----- 
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16. A ve$$6l according to Claim 1 5 wherein said cell is a yeast cell 

17. A vessel according to Claim 15 -wherein said cell is a profcaryotic cell. 

18. A method to desaturate a Fatty acid substrate comprising the steps of: 

i) providing a reaction vessel according to any of Claims 1 4-1 7; and 

ii) growing said cells contained in said reaction vessel under conditions 
which alkrtv the desatuiation of at least one fatty acid substrate. 



15 
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ABSTRACT 

We describe transgenic cells transformed with nucleic acid molecules which encode 
enzymes with desaturase activity and ths use of these cells and enzymes in 
5 biocatalysis. 



10 



27 



-mumm 



>?03;.llil- ■ 



17. FEB. 2804 16:12 HARRISON GODDARD 



NO. 017 P. 33 




Table 1 

-''Composition of tike major fotty zcids in T. pseudonana 
~ ~~~ " Fatty acid ' ~ _ Mol % of total fatty acids 



14:0 


11.50 


16:0 


17.95 


!6:1A9 


19.81 


16:1A11 


"•" 0,19 


16:2A9,12 


:) 2,47 


16:3A6,9,12 




18:0 


6.47 


18:1A7 




18:1A9 


1.3U 


1S:1A11 




1S:2A9 3 12 


2.37 


1S:3A6,9 5 12 


0,98 


. ' lS:3A9 a 12,I5_ ; " 




18:4A6,9,I2,15 


5.72 


20:0 


0.44 


20:3AS = il,14 


0.26 


20:4A5 S S,11 ? 14 


2.46 


20:5A5 5 S,11,14,17 


17,51 


22:6A4,7,10,13 a ie,19 


6.64 


24:0 


0.49 



Microalgai cells were harvested during the exponential phase of growth as described 
in section 2. Fatty acid were extracted and converted to their methyl esters before 
analysis by GC. Each peak was identified by the retention time compared to known 
standards. Each value represents the average of triplicate measurements within a 
representative culture. 



# 
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Table 2 

Composition of the major fatty acids of pYES2 and pYDESN yeast transfotmaints -with and without 
addition of e&ogenaus saturated fatty acids 



Fatty acid Mol % of total fatty acids 

- substrate - .. + 14:0 • +16:0 + 18:0 





pYES2 


pYDESN 


pYES2 


pYDESN 


pYES2 


pYDESN 


pYES2 


pYDESN 


14:0 


0.7S 


0.52 


1.12 


0.96 


0.76 


0.58 


0.76 


0.70 


14:1A9 


0.22 


0.13 


1.20 


1.29 


0.23 


0.14 


0.23 


0.20 


16:0 


18.40 


15.04 


18.37 


14.62 


23.85 


22.09 


17.22' 


15.09 


16:1A9 


39.73 


35.55" 


43.39 


36.24 


42.24 


37.03 


36.24 


31.67 


16:1A11 


0.23 


3.27 


2.36 


5.84 


0.22 


5.57 


0.25 ." 


5.84 


18:0 


7.37 


7.34 


6.61 


- 7.23 


6.36 


6,60 


16.72 


17.47 


18:1 A9 


30.19 


34.32 


24.44 


30.24 


23.89 


25.19 


26.07 


26.58 


18:1A11 


1.20 


1.21 


1.35 


1.30 


1,19 


1.12 


1.0S 


0.96 


26:0 


1.S9 


2.63 


1.16 


2.29 


1.26 


1.70 


1.43 


1.50 



The values .given-are expressed as mol % of total fatty acid methyl esters quantified by QC. All values are "_. 
the meatus of duplicates from a typical experiment 
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TABLE 3 






% conveEsiea 


substrate 


product 


Thai 


ifsfosim pssudonana 


Phaeadactylum triconiutum 


16:149 






14 


6 


lfclitt 


1E:2A6£ 




IS 


5 


1S-.2A9.12 


1S:3A6 S 9,12 




54 


23 


1S:3A9,12,I5 


. 18:4A6,9,12,15 




68 


27 



lfci A9 - l€:ln-7 16:2A<5,9 - 162&-7 18 



!;1A9 = lS:ln«9 18:2A6 a 9 = 18:2n-$ 

18;3A9 P 12 S 15 = 183a-3 18=4^9,12,15 - 18:4n-3 



Both algae are Diatoms. TpDESI and AS-des* turase sequences are 70% identical at the amino acid 
level. Data for the Phaeodaczyltem enzyme ha ire be«n taken, frqm Up following paper (Table 1): 
£)omsrgtt& s P- s LerbW a J M Zaeliriager, U/aad Heinz, E. 
Cloning and fbnctkmal Dharacfcerizatioa of ieodactylum tricomutum ftont-iKnd desaftirases involved 
in eicosapeiitaenuoic acid biosynthesis _ 
Eur. J. Biodum 269 (16), 4105-4113 (2002) 



17. FEB. 2004 16:13 



HARRISON GODDARDl 

Figure I 



NO. 017 P. 36 




17. FEB. 2004 16: 13 



HARRISON GODDARD' 



NO. 017 P. 37 



Figure 2 



A 


Growth phase 


early exponential: EE 


late e^cponeritial; EE 


early stationary: ES 




Incubation time 


142 h 


237 h 


311 h 




Nitrate degraded 


20% 


60% 


100% 



B 



JSStSNA- 
(675 bp) 



TpdesN 
(319 bp) 




■ " 10095.865 :AJ- Fgb-04:. 03 : : if : =| -■ 




I 
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Figure 3 



A 



DYES2 



16:1A9 



16:0 



18:0 



LS. 



ft f » 



u 



i t, 

B 



til 
W 

§ 

2 

9-1 

S 

u 

w 

Q 



dYPESN 



16:1A9 



1&1A9 



16:0 



16:1A11 

18:0 



*J\. 



ill 



Retention time 



Retention lime 
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Figure 4 
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gIGPSS 5a 

COQ3W3SQC!QC!RCXCSGi?Vffl 

CG&A^TGCGAGC&CC^ 
CCCACATOTGGft^GGT^^^ 

TCCMTAGCTCAAGTTGTATGAACkACRAAACG 

GATGATGCCATGAGAGCTGTTTGG^^ 

03TTGTGCGTTMTGTTAGG 

GCGATAC C3^C AAAATCTTC^A3^^CA^GCCAATATCTGAT C AAAAAGCAGC TGTGACA 
. iTCCOTCSOTACCTGCBCTGMaB^ 
fflCRACTGGTAC^CaT^ 

gtocmtatctggksm^ 

CZV^TCaTGAAATATCGCaM 
ACSAGCCGCC^TCAAGAGG^ 

GATATCGCTTCTCCAGAAATGCTTCC! d AAGGC AACSCAGTTTGACCTG&AGT C ATTTGAG 
AAOTACTTCATTOACTT^^ 

ACATGGCTACTCTGGA^ClACSi.GCGGTAC'rSATTAGTATGATCGCGTXATCTGTCATCTCA 

ATCAA^TSCTACCACCA^^ 

CMCftAaOTGGATTTC^ 

CTGAATGACATTTT<^TTGGATCTA*G^ 

TGGAGAGAG<^GC^TAGAGAACATCATO^ 

TTCAAAGATCCCCAGOTOTOT 

3?C GTTG CTCACACATTCG ATXTTATT CATTC&CTC AC AG ATG£^AGAGGAGGT CTGGATA 
CAGAACZU^GAAGTTCATTCCGT^ 

cagcacattctgttccttc6gatgatcttt?mcgttggccgcgttggtattgtcgtagat 

TeTAC&CTOACTGZ^eeAAGTTCCOTM 

aaggz^ctoctgattto^ 

GCTsS 

CTGCGTTTGGCAGATACTAAGCACT 

CTACGGATCT^ 

CTTTCACATTACA^^ 
TATCAATC^TACTCATAGCTCCA 1 ^ 

OTQTOTATTGAGATTCW 

GTGAGGAAAGACTTCGC^ACGAAAGGCCACGGGAGTGTGGCTTTCATGTACACG^^TCT 

tatg&tgtagcaaccgcagct^ 
acgtogatctactct^ 

atgttgctgttg^tgcgggtgctgctgaaccccctcgccgcgggataatggcgaagggc^ 
gtcggacmtxgataatgttcttcacagagttta'tgagctgggtgtttgtaccaatacct 

ccttoatatggeact^ 

gaccgataggtggcoa^ggcc 

actca^gaaaccocgatggtcgaggtattaaaac 
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glgmra 5b 

Pull length cDKA sequences. OCT (Alanine) codoa add by us to bfettm 
yeast expression 

■B^m^CrGTGAGASAGAAGSC&CGA^ 
mCCTC<KGGC»GATAflCW^CT^^ 

CXA^fTCTSGSi^ACaGCGGTXCEG&TrilGrATCJiTCGTOXrATCMrCM^O^T^^ 

S^TraGGGTTSGMCTATGGCRCTGTCTTCrrGGGTCTC^ATGGCGCraGGT<^« 
^GAGCAtWaGAACATCATO^^ ■ 

atcottggccgcgotggtattgt^^ 

Sgt^tax^^gtacatcatcggcxctctttggcUgctaw^ot^^ 

TTCTGCGTTTGGCAGATACTAAGCACTCAAGACTrTGCRTGTCCAOGTTGGTCTOGGTtSS 



A^G^nC^iMGAAAGGCCACGGGAGTGrGGCTrrCAXGI^CG^ 



■axe 5 c 



amino acid sequence deduced from c»MA. se<gruence 

HUHMUW ALQLGTLVGC »TOSWJDhC GDBSIaUHQSM BTKSHKAKPI ffl^W«9 
OTCAVKHKAR KDGXiVLXiDGKT WJCHVEKFVHH HTCGVm-LEQ .* WMMJTW HWTO 
OTMUBHW* KTOEVCLDHM DDFTTOWSIDI A3PEMLBKPT 

naraBnm wSrnw HanrawMfl iialhvismk vlwmkm auua*tiHB 

i^A^S DIlfiWIVGTV MWaTO EEHREKHATTIi EreYDDEBGFK 

n=S mnanSwa heusw»» attuwiw iwrtoiyvb stoieekfrp 

SSS LSlLSQT SKPIFVYIIG SLWQAXMHQ I1I1GHHSVKBW HlW 

ycvncii-sxo dfscfrwsrw xoaauinKa hhufkuw mma ei^bkhsbpp 

IEIAFIDCW GMVNMFHETO KDFATKGHGS VAFWST' 
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Figura 6a 



NAITCCAn^rGCGGG&&T^^ 

asowv&ctcctatc 

atatgcgagactcgttcgt ctsgacc toc^g^^j^atgactggtgaggcca 

aattagttggg^tgcgtgca^^ 

ggcgtcgtcaagaggagatccatsttc^ 

gcgtgcatgatgtgtgcgtgcgtgtacgatgttoataggtagaaagagat 
cgaggcggtq^ttc?^ctattcaggatactgioy^gagttgatatagcagc ; 
agtaata¥ktcctagttgtu?tgt^ 
aatg&cgcaacaataacgt^sgtag^cta^ 
caaaggcttttc&taa^ 

G.^GCC^CTCTGCACCATTCCAGCGra 

AEC&CGACACACCGACCCGAGCCG^^ 

CCGAGCeGATACAGCCGAl^GGC3&A^ 

CGGCAACATCAAMCATGCC^CCCTC^ 

CTTCGTCTCGCCCGCATCC^ 

CCCGCCGCAAACCeCTOkAACl^ 

aacactccc^ccgkmcatggctca^^ 
ttccraatggatatcgcaccatc^ 

ttgccggtatggatgctaccga^gaattgaggdc^txtcatgatgattgg 
gttttggaggagaagttccctcattttgtgattggggaggtggattggac 

- - '- tactacc^cggggc^gagaatacto^ 

- agct^atcaaggatttci^ag)^ttgggtgaacacttcgagaggttgggg 
tactttcacgtcagtccatisgtattacgtccgt^ggtggctaccgtc*t 
cgccatqtttogatgtgcagtcggactcctctxcaataccgattccatcc 

oagcacacatgctcgcgscgst^ 

gc^ttcgtcggac^tgactgtggtc^oatgtcgggggggactcatgcccg; 

TGATCATATCGATGTACCTA&fi^^ 

GGATTTCGGTAGCOTGlOTS^ 

CCAAATAGTGTTGATTGTGACOT^ 

GTTGCATGAjGCACATGTTTAGGTCGTTG'TTTAACAAGTATCATGGGAGGG 
TGATGGiySTTIXsATMGCTGGCSCGTAATGTCT'r'rGTG C CAfTTTCAACfiC 

ttotggtactatcccaeak^^ 

gatcctctagattggatttgctggcgttc^tcgtgttcttctgttggtt 
mcggtgctggtgtcmgcatcccga0ctoggcgga3gcgtatcgc&ttcg 
tcttcgtgagacatgctgtacctgggttactgcatg'tgcaatcagctgtc 

cttogcgaagcccsgttctgggcttxtgccacattggcgtcccgggtgca 



partial ^ino acid sequence deduced ftfom the genomic cDKTA sequence 

S&TTRMi&SR RKFMCKYBPS Q£S£HHtfP2'D A^LIYKSQVI. 
DISKWISHKP GGEOTIjJUAFA * &HDA7X>BXjRA EFHDEWVLSs^ IiPHFVTGEVD WTUTXGG&EKTZ? 
VtSKOSQVSEIi IKDKRBI^Sqt FPRWOTHVS PWY1TVS.KVAT VFAIFGCfiXG LLFSKTDSIPA 
HMLS&VIAGI SWQQFAFVGH .DQGHMSfrRTSI .HRDHXDVPKIj GftliTOFOTGX SVAWtfKkTHU 

ftftlaHEHMFTS LFMKVIJ&KTM HRDWIARHVF 
IM&VAHFCJIiY XQjSTOtFlABK OTGXAG&XOS SRIiDIitAMR. VIiLliVJSGft0V MHPELSG&VB 
IRLHQXCCTW^ VTACRZTGKL t^snappTBS PGWGALSEM- FJ^ATCASS. V- 



no. on 



P. 42 



- 10094365. - ; ;"7->-eb- 04 03 i 
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CTOTGACCACGGGAGCAGCCGtGGCGAGCCCGCGTCT^^ 
TTTTTGGAGCAtfCCCTCACCACAC^ 

<3CCCGACTCTTCATA&CATC^ 

C^TGTTCCACCGAGTCGTCA^ 

OCMTTTCTCCTCGCCCGGTCM^^ 

TCCACC^CTTCTCGTTTCTC^^ 

CAGTSTG2VAC2^CGAAAACAAGG 

CTOOTTCOGAATGCAATGAGGAAGCra 

GATGACG&GGAATGCCGTACTACTTTGGATGTK^^ 

GTACGATeTTTCAGGTGAGTGCAAGT^^ 

TATTCTOTATCGGC&^ 
COkTTTGTTCCTCCTACTTCCTC^ 

AececACTGGATeGAcro^ 

TCTGCC CCCGAAACGCCTGCCCjTT C TTGA^CA-TCTGCAGCACCTTACXC 
GC^ACGOAfiCTTA&CTITAGG^OTTGA^ 

GCATCGTTGGTTACAGGAGCAGCaTTGAGTGCGG^GAGTGCTCCTCCTCT 

TTCAACTTTCTTGTT<^GATTGTC^ 

jkTGATTATATTCATOGTGTTGAT^^ 

GC7GCCGTGGCTGCTCKOTTOOT 

CS^VSCATGMGCTOT^ 

GTTTAAAGATTG^TG^TACCATCGTACTCAfATCCTCAACATTCTTTC 

j^tcgcaacagccaacgaam 

c atttctctttcctta^ gtcccggatccaaagtacgattctc cac txcgt 
aagatcc^cactacatcttct^ 

CTGGCGCGTGKSMMCCTOAaBG-TCGCCGT 
GTCCGTOTGCa^GfeTGAATO 

ttgtt&accttcttcccag 

tggcctcatgtctgcactc^ 

attttgaggagtatc^gcc^ 

ag^y^^^tgtcacgactaatcgattctctgagtggctttggggasgaa^ 
gcaotaccagttggagcatgacttgttcccttccatggcc?xggtaagcag 
cttaatgtttgtatcttgtacgattgttgacttptcgttctcggctaach 
ctgttgga^totatgagcctagcagata^tggtgtGtatgcg^ccati^ 
actcgatttamkttcaaatacct^^ 

TGACGTGTGTCCC ' \ m 

Figure 7b 

QPFVPPTSSG WRKAHPAGPH , WIDW^t>GSI3A TE^TDAFHTQ KGREMYXRLP ASAPETAAVL 

EA&AAPYSQT BLKfFRKXiKDQ LESEGWWEHD E^HEGKLLAI WASLVTOA&L TAESAPPLST 

FLXiGliSMTHA GWLGHEYIHG "TOKFS QiVMRS* fAAVAAGLGP TWSDKE5NKH HALSESDSCC 
YCKCGXtKlBS IPSYSYPQHS B^Ril^ORNGS 

RX^QYHRTHI LITCLSIATZtftf BMGTOEDIAT DSF&FEYVPD PKYDSPLRKX QJTCXFYSPFS 
PXiFALWRVDT LKVAVESYES KRHDAKME^W YLLAETYFVLL TFFPAOWVP AVFLSG1MSA 

LXVTPTH&SE EYFEEYQPEW VTAQPESTRlsr, AVTT3W3F6BW XWGGMQYQLB HHLFPSMPR 
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Pierage 

A&AAAASAAAAAAH^^ 

TTTCAAAAGTCAACTTGAAC^^ 

ACGTGGTGMCAAA^CAeTOC^CAGGC^^ 

ATATTTG I3CAAATO^ 

CTC^CCGGCGCGGCTTCGC6TCC<KKJTCG<5CCCGCTTGCATCCTTTCTTC 
CTCTCAC^TTTATCATCCAACG&GC^^^ 
CAACAGAATAkAACATACTAATC^ 
CGCUkGCCUM2CAAGCGTAGTGG^ 
A&TACACCTGGCAGGASG^GAAGAAGCACGTGM^ 

TGCCGTTTGG^TGTCCTTGTCGTTGGTTCGGA^TATGCS^CGAG^GTTCGT 

ATTOCAACTC^TTTCAATTO^ 
CAACTTCTGCCAGCGSCc^CiCC^TTCAG^^^ 
TACTCOVCCAAAACAAAGTCTACGACGTC^ 

GG^GGAGCGGTGGT<STTCACCi^CGCCGGaG^C{^C^TGACGGAG&TGT i r 

cgccgccttccacoccc5aggctgtcaggccatgatgaagaagttttaca 
ttggagatttgattccgg^agto^ 

gatttcgagaagggatatcgtoatttacgg<?cca^cttgtcatgatggg 
gatgttcaagtcgastx^gatgtattatgcatacaagtggtcgttcaata 

TGTGC^TGTGGTTGGTGGCGGTGGCCA^^ 

GCAATGCACaTTGGATCGGCTCTCTrGTTGGGArraG^TCTGGCAGCAGTO 
TGGATGGCTTGGGCACG^TXTCTTCACCACC^GTCTTT^GCAACGAA 
AGTACG£^GATCTCGTTGGCATCTTTTGGGGAGATCTGATGCAGGGGTTC 
TCGATGCAGTGGTGGAAGAAGAAGCACAATGGCCACCATGCTGTTCGCAA 

cttgcacaiv.ctcttccttggacagtcaggatsgtgatcccgatattgata 
cgatgccactccttgcttggagtctcaagcaggctca<^gct - 

ATC3ATAACXK3AftAi^ 

c^cattcacat^cttccccatcctcctctl's&ctggcatctpttggttga 

atgaatccttcaaaactggattggsactcggaggtgccxcggagaatggc 

aagttggagttggaga^gcgtggacxtcagtacccacttttggagaagct 

tggaatcacccttcattacacttggatgttc^^ 

gaaggtggtctcttccatattccatca^ 

tcctcgggacttttcctcgcattggtctttggattgggacacaaoggtat 
gt<^gtgtaggatgccaccacccgacctgacttctggcaactccar^tca 

CCaCTAeACGTAACATCATTGGT<M^ 

GATTGGTTCTGCGGTGGATTGCAOTAqCA?kGTGGATGACCACCTCTTCGC 
CATGATGCCTAGAAACAATATGGCGAAATSCCACA?y3CTTGTGGAGTCAT 
TCTGTAAGGAGTGGGSTGTG^GTACC^GAGGCCGATATGTGGGATGGT 
ACCGTGGAAGTGTTGC^CATCTCTCCAAGGTGTGGGATGATTTCGTTGT 
GGAGATGGTGA^GGATTTCCGTGCGATGT^AACACGTATTACCAG'XCGGC 

ATTATTAATGAACA&TOTTACCeTGCAGTCGTG^GTTTGCCTTCGT'TGT 
CCC ACCCCTTeTATTGTS^TTG^^ 

T CTACATCAGATCTCiCCATTCACQCTCGAATAGTATG CC&ACAACCATG 
ACATCA^CTACTTGAATCTCeTeT&TC 

TACTCTCAAAGGTATATCTATTTGtCCCTTTATTAATTGTTOAA^ATTGA 
AGGGGAA(^TTCCATT^TCCCCTCTCTCTTCCCCGATGATCCTCTCACCT 

CTAAATACGTTTCAC^CACAACAAC<^^ 
ACATGGCAGAACTAI^^ 

TTGGTAGTCCATCTGCAACGCATGTdTGGCTCCAGACCCTCATCCTGAlyS 
AGTGAGTTGTGATGTCGCTGAT&TACTTTCCGTCTOGATGTXGTC'TGAfiG^ 

TGTCM^CTCAGGGTGACC^^ " 
AAGATCCGATTTCCGATCCC^^ 

CTTGAA^CTCGAGGGTGACGAAT^XTGGGGGCGG^GK " 
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Figure Sb 

GCT (Alanine) codon add by us to improve yeast expression 

ATGGClOSajUWGGaG^ 
GCGSai^CCeCAGAAGTACaC^^ 
G CCTCGSIAGTCCACCAAAAC^^SS? CTAC G ACOTCTCCAACTGGT&G 6ACCACC CCGGT 

G^araTTCTGTOCACC^ 

nAATATCTOC^-TO'JG GTTS<STG<3 C 5 QTG G CGATG GTGT ACT Ji Cr CGGft. CAG TT TG G CA . 

TTTTGGGG^AT CT G&TG C&GCGGTTCT CGATGCAST GG T GG AAGAACAAS CAt^UVT G<5C 
GAG C ATG CTGT TCCC^M^TTG CACAACT C TTCCF T G SACAGf £aO<S AIXjGT GAT CCC GAT 
- ATT O^Ti^CCATG CG ACT CCTTG K^G^GT CTCA. 5 ^ CAS GCTF CACSAGT XT CAGARAGATC 
AAT AAGG G AAASaAC AGT AC CT TC GT CAAGTACGCTAT ElAAACTC GAGG CATT CACATAC 
tfTCGCGATCCTCaTCOTGGCTCG*^ 

GGATTTGGAAGGTGt^CTCTOCCOTAT^ . 
TCCHSGACTTTTCCTCGCATTGGT^ 

GCCAGCACGCGACCTGACTW^^ ,i 

GATC^CCACCECTECCeeATGATGCCTA^ ■ 
G&GTC^EITCT'GWAAGGAGTG^ • 



GATrrCCCXGCCS.TGTAA 

Figure ac 

Amino acid setpienae deduced from the cDHA sequence 

MASKSGDlAAA. ATB3LSGAL3X AlSKPQKYTEtfQ 5ktt^O.FlKVY PVSNU3YDHSG 

GAWFTHAGD DMEDIFAAFH AOG$G*MrcfcK ITOCJIHOTHfl VBHKDQRQLD FSXGTtRDIiRA 
KEWMMGMFKS SKHrWttfKCS FMMCMWIiVAV AKVySS».StA WSISSA&LiIjG LFWQQCGW1A 
HDFXiEBtftVFK QRKYGDIiVGI FWGDLMQSFS KQWHKNKHPQ HHAVENfcHHS SLDSQPGDFD 
XDOTSFJJfcAWS IjKQAQSPREr MKSKDSTTOK' Y&IKFQAFTY FEELM**!* W&JStBOTKT&F 
GliGAASBNAK. LHUSK»Gt^ PIifcEKLGITL HTfTHMFVLSS SfSRWSXjPYS IMYFFTATCS 
BGULALVFG ATOR5DFHQL QVTT5!&Wl£G eHGEPQFFVD WGGGLQTQY 

DKHIJDMMTO EJNIAKCHKL.V BSFCKEWeTO *SHEftPMWD5T VBVLQHliSKV SDDFLVEMVX 
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Figure 9a 

TATGTCCACCCCCCCCTGGTTTGTCCACCTCTGTGTTGGATCTTGGSAGC 

CGGGTCTCGAGTTTGCGAGACCTCTCA&GCGGGC . 

GjCTGTGTTTGCT(?AT&CCTGA<^ 

CTTTTCGTAACTCCACC^GGTACGACTGACCT 

GAT^CCTCTATGGC&^GCCGM 

G<3-AT^JlO C A<^ATGGA.TAGGCGGGGGAT CTTCLk^T GGACAAGAGGA&.GAG 

i^AOAGTATGTCGTCGGGG^GGTGC^^AGOTTGAGAGAGAGGGGATGAC 

AGATTGTACATTATCTTOCCTCCA&^ 

ATTAGAATCTTACAT^ACGTGGA 

GAAGC^CCCC^TG^CCATTTCGT^ 

T^GXG^CCGACGACGACGAACAGAACCGCTGAATGXGGXTCGGCAACA 
ACAATAC^CCAATATGTGCAAC(MCAACCTCCCAGCATCCACCGCACAGG 
TC^GTCCmCGTCGA?kQCCCCAGCAGCAACATGAGCAXCGCACCATGTGC 
AAGTCCl^GCTCGCCCAAGACAACACGCCCAA^^OAGCATGGTGTGCCGT 
CaACTCCACTCCCGCGACGGACCCATCCCACTCC^CAA.GA2^ACAACACG 
CAC&CCT&OTCCTCGA^^ 

QACCTCATCCTCCTGGCTTCCGGCAAAGAGGCCTCGGTGCTGTTTGAAAC 

ATACCATCCACGTGGAGTTCCGACGTCT CTCATTC AAAAG CTGCAGATTG 

GAGTOATGGAGGAGGAGGCGTTTCGGGATTCGTTTTACAGTTGGAGTGAT 

TCTGACOTTTATACTGTGTTGAAGAGGAGGGTTGTGGAGCGGTTGGAGGA 

GA<KM<MTTGGCGAGGAGGGGATCGAAAGAGATTTGGATCAAGGGTTTGT 

TCTTGTTGGTTGGATTXTGGTAC'TGTTTGTACaAGATGTATACTACGTCG 

GATATTGATC^mCGGTATTGCCATTGCCTATTCTAT'rGGAATGGGAAC 

CTTTGCGGCACTCATGGGCM^^ 

C&TTCOCTC AGAAG AAGTTACTC^C^^ 

ATGATTGGTCKJGAG 

TCATCCAT^^ACGAATGTGTTGGATGGGGTGGAGGAGGAGAGGAAGGAGA 
G<5<3<5<M^GGATGTTGCTTTGGAAGAS^ 

AGAGAGAGAGAGAGTCTATTCGTGTSfeAGTCGTAGATGCATGTG'rGCGAT 
TGAGCGACACAACTCTAACGCi&M&C^TTC 

GAATCAGATGCAGACGTATTCTCCTCCTTCCCTCTC^TGAGAATGCArCC 
CCTGCATACAACCTCATGGTATCAT^ 

CCCTCTTTGC^TTGATGACACTTGCCAAAGTATXCCAACAGGATTTTGAA 
GTTGCCACATCCGGACGATTATATCATATTGATGCCA^TG'tAGGTTATGG 
TTCGGTATCKSA^TGTCATO^^ 

C^TATATGATGGGATTAGCAATCTACTTTCATGGAGTAGTGAGGGG^S'rT 
CSGATTGTTTGTTATTGGGCATTTGGCGTGTGGAGAGTTGTTGGCGACGAT 
GTTTATTGTGAATCAGGTCATTGAGGGTOTGAGTXATGGAACGARJSGATT 

tggttggtggtgggagtcatgtagatgagaagaa^attgtcaagccaacg 
actotattgggagatac^ccaatggaftaagactcggisa^gagggattgan. 

i^GCAACAGC&ATAfrCAAC^^ 

GCCTTCCATTCAACXSACTGGGCAGC^TCCAATGGGAGAGCTCCGTGAAT 
' TGGTCTCCAGGGTCA , rGGTTC , rGGAATGAGTTTTCTGGGGGACTCTCTCA 

TCAG"ATTOMC^ 

AT^TCC2%^ATGTTGTGGAGAGTACGTGTGCTGAGTACGGAGTTCCGTAT 
CAGAGTG2^AGTAATTTGTTTGTTGdTTATGGAAAGATGATTAGTCATTT 

G^GTTTOTGGGTAAAGC 

GTCGAGTTGtfCttC^TCTTTAA^^ 

ACAAGCTTTGTC^TACC^^ 

CTCCGTGAGGAjGTTTGACCTCACTCATTCTA^CTXI^AATGTCTCTTTXG 
CGCTGGTGMCTEGGAGGAATACa^ 

ATGC^GAGGAA^AGGATATAGTGTATGAGTCCGAAGAATCGATGACGCG 



TGAAGAGGTGGATACCTAAACATTTGAGCGGTCTTGGGASCAAACTTTAG 
-GAGAGA^G^^Ge^GA^GGG^A^ 



AGGAACAGCMCGAACAGAGCGGGG 



• 1-P094865 vf-Fes-lK ''QZzWl — 
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Figure 9b ' ~ - 1 * 5 - - - " -* 

MCBFG-MrFAST AQLKSTSKPQ QQHEHHTXSK SBXAOHOTPK SAWCAVHSTP ATDPSKSIOTK 
QK&HLVIiD XI DFASRHPGGD L2LLASGKX3& SVfcFETYKFR GVFTSIiIQKXi QXGVMEEEAF 

KDSFYSTCTOS DFYTVIrKKRV VERLEERSX^ RfcGSKEIWIK ALFLIiVGFWY OIiYKMyTTSD 
XDQYGIAXAY SIGMGTF^AF - 2GTCXQHDGM HGAFAQNKLL NKLAGWTXiDM XGA5AFTCTEL 

QHMiGHHPYT NVXJDGVEEER ECEftGEjPVkLE EKDQfTOEDDR EUBSLFV 

X^KKQHLYAP I? LP AiMTLAK VFQQDFEVAT SGEXOTXDAST VR^GSVWXJVM RFWAMKVXTM 
GYMMGLPIYF HGVLRGVGLF V3GHLACGBL XiATMFIVNHV IEOTSYGT&D XjVGGASHVDE 
KKTVKPTTVXj GDTPMEyTOB EMTCSffSHMBf" KKKGEKNSVP SVPFJTOWAAV QCQTSVMWSP 
GSWFWNBPSG GLSHQXEHHL FMICHTNYC KIQDWESTC AETOVFYQSE SNBFVAYGJO* 
ISHLKFLGKA KCE 
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Figure lQa 



MQQACTTTCTCTCCGGCGATCCT 

GTGGCAATGCTTCTACCCGCCG^ 

GCAATGGT2^CTC^TACCA^ 

TCGTTTC&GCATCCAGGAGG 

CGACGGi^CaGCTGTATTTGAGTCACAGCATCCeMCAtoCCTCGAAAGA 
ATCTACTTCAGATCCTCTCC^^ACGAOOT'TC.CGTCGAGTGAAGAGTCT 
GTTTGCTTGATOTCC^CCCTAXpACGA^CTCAATGGTGAATCTGGGTACGA 
T TGGAAGGAC^TTGAAAATGATGATTTC GT ATCTGAC CT ACGAGCTCtCG 
T^TTGAGC^CTTTTCTCCTCTCGCCAAGGAAAGGGGjASTXTCACTCGTT 

gagtcgtcgaaggcaacagctgagcggtggatggtggttctagtgctcct 
tgcgtcgttcttcgtca^catcccattatatttgagtggttggtggactt 
tcgttgtcgtc^ctccca'tcctcgcttggctggcggttgtcaattac'x'ats 
cacgatgctactcactttggattgmcj^gaactgqattttcsaatgctgg 
gctcccatatctcctcggtctcctatcgag'tccgtcaa , rgt(5statcatg 
atcacgtcattgkjac^tcacgcatacaccaac^ 

i3[A73COT!QCTCACH3CTO 

GAGACCATCTCACTTAAATCAAAGACAGCTTCCGCGGA'TTGTCTTCATCT 
GGTCGATTGCAOTCGGTATTGGGTTGAACTTACTGAACGACGTGAGAGCA 
CTA^CCAI^CTTTCATACAACAACGTTGTTCGGGTGGAGAAGATGTCATG 
GTCGCGAACATTACTCi^TTTCCTTGC^CGTATGTraCACifeTCTTTGTGA 

ctac^ctttggccctttttggggtttccggtotggaaggc^tggtttgg 
gcgactgtaccgaatgccatactgagtttgtgcttcatgctgaataggca 
aatcaatcacctc&tcaacacgtgtgca 
ac^ocatca&gttgtaact^ 

tccttcatcttctcgggaggtctgaactaccaaattGaacatgatttgtt 
gccgacggtc^aaccatogccatttgccagctttggccccgggtgtagagc 
gtttctgtaag&aac acggggtgacatac^^ctctgttopiaggatagaga 

GAGGCCATCATXGCACACTOXGCAC^TACCAAAGATATGTCGACGAAGCG 
TACTGATTGA 
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Figure lla tPpp^SO = Thala^BBiaaiga pperKtoggqaa, EESATUEASB Q 

AfclNTCTCCCACCCNGCCAGCTCTTTC,^ 

TTCCC^CC^CTEGGTCCTCCATACGATC 

TTGACTTCTTG&CGGAGGAGTGGGATGAA^ 

CTGATXTTCCTGAGAAGGAGAAGTCAGCUXSG^ 

TGTGGCA&CCCTATTGAGACGT^ 

AGCATGCAAAC^GAATGGTATAi^TCACTi^^ 

AGTAGGAATAGC^TGACGAGATGGTGTACAGATGTTA^ 
TTAAGCGAATGGCTGGATC5ATX^OGA07ATGCAATGCAAAACTGTATAGAT 
TCTTGCXAkT^GkCTTTOTAG^^ 
CTAATTAATATI&AAACCGA^^ 

GTATCKr^TTCACTACGTCGCTTGACAGGAAGCTCACGTGGCCTCGeclGA 
AGAAGAC^AACAAAACCGAGCCCTCACATTT^^ . 
TC^CACCACCAATAC^TGCCCCCCAACGCCGATATGTCGGGCIATCCGC 
AAGCGCATCCCC^CCM^AACAGGTACCGTTGCCTCTGG 

CCeCGCCACCCAATCCGTCCGAACCCTCS^^ 

TCGTCATCAAGGGCACAATTTATGAGATTGCTGACl'TXGTCCATCCTGGA 

ggagaggttgtcaagttctttogtgggaatgatgttactattcagtataa 
tatgattcatccgtatcatacggggaaac&tCtggagaagatgaaggcxg 
ttgg2usaggttgtagattggca^^ 

TTGACGTTGTTGTTGTGTCATTTCGTT^ 
CCTCTACACACCTCTTCGTTACGAT^^ 

aacgagagatgtkaatcagaagtgttc^gal'cgtacgxcgcgggggtoag 
ttcggcacaacaggctacttcctctctgccttottctacatcgcmtctt 

CTTCACCATGCAkTACACTTO 
ATCACTGGTATCAGAGTGGTGTATTC^^ 

CAGGCATTCATTGGGTTGSATGTGCAGCiACGATGCCAATGACGGAGCTGC 
CAGT AAGCGTCCCTGGGTG AATGACTXGTTGGGATTTGGAACGGATTXGA 
TTGG^TCTA^JCA^TGGZ^^ 
TAC^CTAACC&TAG^^ 

GTTTGCATTCAATGACTATCCCATTGGACA^ 

ATAGGTTCCSySGGASGGXACTTCCTCTTCATGCTTGGAGTTTACTGGCTC 
TGGACTGTATTC3VATCCGCAATTCATTGATCrrCGTCAACGTGGGGGTCA 
GTACGTCGGAATTGAAATGGAGAATGATrTCATTGTdAAGAGGAGGAfiX5T 
ACGCCGTTGCATTGAGGATGATGTACATTTACTTGAACATTGTC.2W5CCCC 
TTCATGAAGAATOGTTTGAGCTGGTCTACCTTTGGAATCATCATGTTGAT 

i^gg^tcagcgagagtgtgagtctcagtgtgctcttctc^ttgtgtcaca 

&CTTCATCAATTCGGATCGTGATGCTAC 

Gi3\ACAAGTGTGCTGGTTCAAGTCGCftj30TGGAGAGTTGGTCTAGCTATGG 
GGGTTTTATTTGCGGATGTCTTACGGGAGGAGTCAAGTTTGAGGXGGASi.C 
ATCATCTCTTTCCCCGTATG^GCAGTGCTTGGTATCCTTACATTGCACCT 

ac^t^cgtgaogtttggaagaagcacggggtga^ctacgcttattatcg 

TTOGATTGGGCAGiU^TTraGTAT 

GTAGTGGAGCCAAGTGGGAGCTC^AGCCGTTGTGTGGAAGTGCCTAAAGT 
TTAGTTGTACTGATTGTCGGAGGTGGTGCTGGTGCTTGA?1CTAAT(STTA3 

gagtgcatgct&&&aggcotctt 
tatcagcttcgatatgttgcm^ 
ju^tctagcata?u^tttc 

acgcj^tgcctctctcataagccggaaac^gmgaccagcttcatactc 
taatcgtccatgtttggcagctgcaatcc^gcgctagcagcaggtctctt 
actgaactccatcggagtca^cttcg^atctocgcccgcatc^atctcat 
gc^^ccgtgccctctctacca^ 

GCGATTCCACGTAGTACGTTOGCTCGCTC^^ 

tttgtattgtgatatactgtgctggtgc^^ 

GTTGljrGACTGTGCATTOGATGCTGCTGTGAAACAGTCGGTGCAGtGTAGG 
GGAGGTGGTGT0 f TCTGAACTGAGGAGATGCCCGCA?iACTGA , £ I Al5GGGGTG 
GTGCAGCGCTATAAATTTTGCGAGCGAGTCCATTGTCCTTGCTCTCCCCA 
TATGTCGGGCGAGGGCGl^GCGCGAAGGAGAAGGC^CAAGGCCAATACAA 

-TACAAC 
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Figure xXh 

OCT (Alanine) codon add. by ns to improve yeast aacpa^esioa 

ATGSCTTCCCCCAACGCCGjOTATCTCC^ 

aCCGTTGCCTCTGCCGACA^ 

CTC^GGGC^CGAGGTCGTCATC^ 

ATTC&TCOTTATCATACGGGG^^ 

GATTGGC&GTCGSRCTACAAOTTCGACACCCCCT^ 

TTCAAGATCGTACOTCGCG 

TTC3?AC^TC<3CTCTCTTCT 

ACCTACGATC&CTCffiTATCA 

OC&TTCMTGGGTTGAATGTCC^ 

TGGGK^TGACTTCTTC^ 

ATGGC^O^CATTGGACTCM 

TTCAGCTCTOAACCm^ 

Ci^ATGC^^^TCAT'TT CATTG T CAAGAGGAGGAAGTACG C C GTTGCATTGAGGATG A.TG 

TACATTTACTTG^AACATTGTGAGCCCGTTCATG7^ACA3\TGGTTTGAGCTGGTCTACCTT , r 

<^AATCATCATCTraATGGG^TCAGCG^ 

TCTCACAACTTCATCAMTCGGATCGl^ 

CSMTOTGOTTOOT 

GGATGTCTTACGGGAG<^CTC^ 

AfSTGCTTOOTMCCTTM 

AGCTACGCTTATTATCCTTG^T^^ 

COCGCTGGTAGTGGAGCGA^CTGGGAGGTC^GCCGXTGTCXGGAAGTaCCXAA 
Figure- lie Affliao acid sequence deduced from cDKTA sequence 

M&P£M&Dr$R fcRKmSJPTKEG TVSfiftDNHBF ATQSVRTlx&S IrKCNBWIIirQ TIYDIADFVH 
PGGEOTKFFG GHffVTIQYHM IKE^nSTGBm EKMKXWBKUV DWQSlD3>^T JPFERBIKSBV 
PKI^RHSESP GETOYFLFAF ^YTAtPPTHQ OTfttoCOTBT? TVDM^QSGV FIAXVFGISQ 
aFIGLTSFVQHD ANHSAASKRP WVKDU^SFG* EMOS^WPW MAQHWEHHATC TNKSEKDPpS 
FSSEEKFaFH DYPIGHSKRK MWHE^QGGYP t'PMLBti'SWLP TVFNPQFIGt, K^RSAQYVQI 
QHENDFIVKR RKYAVApSLMtf YIYLNIVEFF MHKGLStfSTF GEiM&MGXSE SI/HuSVIiFSIj 
■SHMPEtiSDRD PTAPFKKTQB QVCWFKSQVE TSSTOSffiFIS GCLTGGLWFQ VEHHLFPEMS 
S&WtfPYX&PT SYATCFWIGO WI.VS'tFKiMH RSGSGSftrWEfc KPIiSG-SA 
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